Multilingual Query Expansion for CLEF Adhoc-TEL

نویسنده

  • Ray R. Larson
چکیده

In this paper we will briefly describe the approaches taken by the Cheshire (Berkeley) Group for the CLEF Adhoc-TEL 2009 tasks (Mono and Bilingual retrieval). Recognizing that many potentially relevant documents in each of the TEL sub-collections are in other languages, we tried to use multiple translations of the topics for searching each subcollection, combined into a single query. Overall this strategy performed very poorly compared to the the basic monolingual approach used last year (and repeated for one run in each language this year). We haven’t yet completed our analysis of the reasons for this (we suspect that results were evaluated expecting the retrieved items to also be in the same language as the topic). Once again this year we used probabilistic text retrieval based on logistic regression and incorporating blind relevance feedback for all of the runs. All translation for bilingual tasks was performed using the LEC Power Translator PC-based MT system. Our results this year, however, were surprising poor compared to last year’s results. Some testing has shown that, for some cases, unexpected hyphenations in the machine translation and untranslated words were to blame. It may also be the case that others have significantly improved their approaches for this task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating Cross-Language Explicit Semantic Analysis and Cross Querying at TEL@CLEF 2009

This paper describes our participation in the TEL@CLEF task of the CLEF 2009 adhoc track. The task is to retrieve items from various multilingual collections of library catalog records, which are relevant to a user’s query. Two different strategies are employed: (i) the Cross-Language Explicit Semantic Analysis, CL-ESA, where the library catalog records and the queries are represented in a mult...

متن کامل

CACAO Project at the TEL@CLEF 2008 Task

The paper describes the participation of the CACAO project consortium to the TEL@CLEF 2008 task targeted at retrieving relevant items from collections of library catalogues. CACAO proposes the development of an infrastructure for multilingual access to digital content, including an information retrieval system able to search for books and texts in all the available languages. For each monolingu...

متن کامل

Dictionary-based CLIR for the CLEF Multilingual Track

This report describes the work done for our participation in the multilingual track of the CrossLanguage Evaluation Forum (CLEF). We use a dictionary-based approach to translate English queries into German, French and Italian queries. We then apply a term disambiguation technique to select the best translation terms from the terms found in the dictionary entries, and a query expansion technique...

متن کامل

The University of Indonesia's Participation in IMAGE-CLEF 2005

We present a report on our participation in the English-Indonesian image adhoc task of the 2005 Cross-Language Evaluation Forum (CLEF). We chose to translate the Indonesian query set into English using a commercial machine translation tool called Transtool. We show that some improvement in retrieval effectiveness can be obtained using a query expansion technique. We used an approach that combin...

متن کامل

Technical University of Lisbon CLEF 2008 Submission: TEL@CLEF Monolingual Task

We describe our participation in the TEL@CLEF task of the CLEF 2008 ad-hoc track, where we measured the retrieval performance of the IR service that is currently under development as part of the DIGMAP project. DIGMAP’s IR service is mostly based on Lucene, together with extensions for using query expansion and multinomial language modelling. In our runs, we experimented combinations of query e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009